19 research outputs found
Tile2Vec: Unsupervised representation learning for spatially distributed data
Geospatial analysis lacks methods like the word vector representations and
pre-trained networks that significantly boost performance across a wide range
of natural language and computer vision tasks. To fill this gap, we introduce
Tile2Vec, an unsupervised representation learning algorithm that extends the
distributional hypothesis from natural language -- words appearing in similar
contexts tend to have similar meanings -- to spatially distributed data. We
demonstrate empirically that Tile2Vec learns semantically meaningful
representations on three datasets. Our learned representations significantly
improve performance in downstream classification tasks and, similar to word
vectors, visual analogies can be obtained via simple arithmetic in the latent
space.Comment: 8 pages, 4 figures in main text; 9 pages, 11 figures in appendi
Understanding the Requirements for Surveys to Support Satellite-Based Crop Type Mapping: Evidence from Sub-Saharan Africa
This paper provides recommendations on how large-scale household surveys should be conducted to generate the data needed to train models for satellite-based crop type mapping in smallholder farming systems. The analysis focuses on maize cultivation in Malawi and Ethiopia, and leverages rich, georeferenced plot-level data from national household surveys that were conducted in 2018–20 and integrated with Sentinel-2 satellite imagery and complementary geospatial data. To identify the approach to survey data collection that yields optimal data for training remote sensing models, 26,250 in silico experiments are simulated within a machine learning framework. The best model is then applied to map seasonal maize cultivation from 2016 to 2019 at 10-m resolution in both countries. The analysis reveals that smallholder plots with maize cultivation can be identified with up to 75% accuracy. Collecting full plot boundaries or complete plot corner points provides the best quality of information for model training. Classification performance peaks with slightly less than 60% of the training data. Seemingly little erosion in accuracy under less preferable approaches to georeferencing plots results in the total area under maize cultivation being overestimated by 0.16–0.47 million hectares (8–24%) in Malawi
Recommended from our members
Topographic Controls on Southern California Ecosystem Function and Post-fire Recovery: a Satellite and Near-surface Remote Sensing Approach
Southern Californian wildfires can influence climate in a variety of ways, including changes in surface albedo, emission of greenhouse gases and aerosols, and the production of tropospheric ozone. Ecosystem post-fire recovery plays a key role in determining the strength, duration, and relative importance of these climate forcing agents. Southern California's ecosystems vary markedly with topography, creating sharp transitions with elevation, aspect, and slope. Little is known about the ways topography influences ecosystem properties and function, particularly in the context of post-fire recovery. We combined images from the USGS satellite Landsat 5 with flux tower measurements to analyze pre- and post-fire albedo and carbon exchanged by Southern California's ecosystems in the Santa Ana Mountains. We reduced the sources of external variability in Landsat images using several correction methods for topographic and bidirectional effects. We used time series of corrected images to infer the Net Ecosystem Exchange and surface albedo, and calculated the radiative forcing due to CO2 emissions and albedo changes. We analyzed the patterns of recovery and radiative forcing on north- and south-facing slopes, stratified by vegetation classes including grassland, coastal sage scrub, chaparral, and evergreen oak forest. We found that topography strongly influenced post-fire recovery and radiative forcing. Field observations are often limited by the difficulty of collecting ground validation data. Current instrumentation networks do not provide adequate spatial resolution for landscape-level analysis. The deployment of consumer-market technology could reduce the cost of near-surface measurements, allowing the installation of finer-scale instrument networks. We tested the performance of the Microsoft Kinect sensor for measuring vegetation structure. We used Kinect to acquire 3D vegetation point clouds in the field, and used these data to compute plant height, crown diameter, and volume. We found good agreement between Kinect-derived and manual measurements
Corrigendum: Satellite detection of cover crops and their effects on crop yield in the Midwestern United States (2018 Environ. Res. Let. 13 064033)
The original raw dataset used to generate this work contained a number of duplicate entries—roughly 7% of the total farm fields. The substantive majority of these were from one large farm that had conducted their operations in a way that caused duplication as a side effect in our data generation process. Unfortunately, as the error was in the raw dataset, its correction required a re-run of the entire data pipeline, resulting in numerous small downstream changes. With respect to the most important numbers, the accuracy of the classifier went down slightly from 91.5% to 91.2% measured in absolute terms but increased from 0.68 to 0.74 measured by kappa. The trend in cover cropped acres grew slightly stronger, and the yield effects in maize and soybean moved from 0.65% to 0.71% and 0.35% to 0.29% respectively. None of the overall conclusions of the work have materially changed. Below, we provide all changes to the applicable sections of the original manuscript in bold underscore (or strikethrough) where applicable, in addition to modified versions of the corresponding figures and supplementary materials
Understanding the Requirements for Surveys to Support Satellite-Based Crop Type Mapping: Evidence from Sub-Saharan Africa
This paper provides recommendations on how large-scale household surveys should be conducted to generate the data needed to train models for satellite-based crop type mapping in smallholder farming systems. The analysis focuses on maize cultivation in Malawi and Ethiopia, and leverages rich, georeferenced plot-level data from national household surveys that were conducted in 2018–20 and integrated with Sentinel-2 satellite imagery and complementary geospatial data. To identify the approach to survey data collection that yields optimal data for training remote sensing models, 26,250 in silico experiments are simulated within a machine learning framework. The best model is then applied to map seasonal maize cultivation from 2016 to 2019 at 10-m resolution in both countries. The analysis reveals that smallholder plots with maize cultivation can be identified with up to 75% accuracy. Collecting full plot boundaries or complete plot corner points provides the best quality of information for model training. Classification performance peaks with slightly less than 60% of the training data. Seemingly little erosion in accuracy under less preferable approaches to georeferencing plots results in the total area under maize cultivation being overestimated by 0.16–0.47 million hectares (8–24%) in Malawi
Mapping Smallholder Yield Heterogeneity at Multiple Scales in Eastern Africa
Accurate measurements of crop production in smallholder farming systems are critical to the understanding of yield constraints and, thus, setting the appropriate agronomic investments and policies for improving food security and reducing poverty. Nevertheless, mapping the yields of smallholder farms is challenging because of factors such as small field sizes and heterogeneous landscapes. Recent advances in fine-resolution satellite sensors offer promise for monitoring and characterizing the production of smallholder farms. In this study, we investigated the utility of different sensors, including the commercial Skysat and RapidEye satellites and the publicly accessible Sentinel-2, for tracking smallholder maize yield variation throughout a ~40,000 km2 western Kenya region. We tested the potential of two types of multiple regression models for predicting yield: (i) a “calibrated model”, which required ground-measured yield and weather data for calibration, and (ii) an “uncalibrated model”, which used a process-based crop model to generate daily vegetation index and end-of-season biomass and/or yield as pseudo training samples. Model performance was evaluated at the field, division, and district scales using a combination of farmer surveys and crop cuts across thousands of smallholder plots in western Kenya. Results show that the “calibrated” approach captured a significant fraction (R2 between 0.3 and 0.6) of yield variations at aggregated administrative units (e.g., districts and divisions), while the “uncalibrated” approach performed only slightly worse. For both approaches, we found that predictions using the MERIS Terrestrial Chlorophyll Index (MTCI), which included the red edge band available in RapidEye and Sentinel-2, were superior to those made using other commonly used vegetation indices. We also found that multiple refinements to the crop simulation procedures led to improvements in the “uncalibrated” approach. We identified the prevalence of small field sizes, intercropping management, and cloudy satellite images as major challenges to improve the model performance. Overall, this study suggested that high-resolution satellite imagery can be used to map yields of smallholder farming systems, and the methodology presented in this study could serve as a good foundation for other smallholder farming systems in the world